Medical Physics
○ Wiley
Preprints posted in the last 30 days, ranked by how well they match Medical Physics's content profile, based on 14 papers previously published here. The average preprint has a 0.03% match score for this journal, so anything above that is already an above-average fit.
Zelter, A.; Riffle, M.; Merrihew, G. E.; Mutawe, B.; Shulman, N.; Sanders, J. A.; Noble, W. S.; Johnson Erickson, D. P.; Morimoto, A.; Shaver, B. A.; Steins, T. N.; Cao, N.; Ford, E. C.; Rudnick, P. A.; Chelsky, D.; Wan, K. H.; Inman, J. L.; Chang, H.; Snijders, A. M.; Mao, J.-H.; Celniker, S. E.; De Chant, J.; Obst-Huebl, L.; Nakamura, K.; Wu, C. C.; MacCoss, M. J.
Show abstract
Ionizing radiation induces molecular responses that may be used to estimate exposure when physical dosimeters are unavailable. Here we present two large-scale proteomics datasets generated from mouse dorsal skin punch samples collected following controlled X-ray exposures spanning multiple doses, dose rates, and post-exposure time points. Experiment 1 comprised 96 samples (including 16 reference samples) collected 6 days after exposure to 0-75 cGy delivered at either 30 or 300 cGy/min. Experiment 2 comprised 936 samples (including 236 reference samples) exposed to 0-100 cGy at either 3 or 28 cGy/min dose rates and harvested between 7 and 150 days post-exposure. All samples were processed using a standardized workflow involving automated bead-based digestion and data-independent acquisition mass spectrometry. The datasets include multiple pooled reference sample types, process controls, and system suitability standards ensuring high quality data. All data presented are available via ProteomeXchange at several levels of processing, from raw files through normalized peptide- and protein-level abundance matrices suitable for biomarker discovery and machine learning applications. This dataset will facilitate generation of new insights into the biological changes and molecular signatures resulting from X-ray exposure in mice and may also help inform future studies in humans.
Rudi, G.; Vula, F.; Bicaku, A.; Dedushi, K.; Ahmetgjekaj, I.
Show abstract
Computed tomography is the largest contributor to population radiation dose from medical imaging, yet no diagnostic reference levels (DRLs) have been published from Kosovo or the Western Balkans. This retrospective audit analyzed all CT examinations performed on a 128- slice scanner at the University Clinical Centre of Kosovo between January and March 2026. After exclusions, 1,535 acquisitions from 1,092 patients across nine examination categories were analyzed. Local DRLs were defined as the 75th percentile and compared against German (BfS 2022) and Turkish (Kahraman et al., 2024) reference values. Head CT (n = 590) demonstrated CTDIvol 4.7% below the BfS DRL yet scan length 98.5% above the orientation value (median 25.8 vs 13 cm). Abdomen-pelvis CTDIvol matched the BfS reference while scan length exceeded it by 28%. Coronary CTA showed CTDIvol +377%, consistent with retrospective ECG gating. Excess scan length, not CTDIvol, is the major driver of elevated dose at this institution. The identified excesses are correctable through technologist landmarking training, protocol review, and enabling iterative reconstruction.
Ge, Y.; Li, E. J.; McDonald, S.; Geagan, M.; Parma, M. J.; Gao, M.; Mei, K.; Pasyar, P.; Im, J. Y.; Muller, F. M.; Pantel, A. R.; Karp, J. S.; Noel, P. B.
Show abstract
BackgroundRealistic PET/CT phantoms are essential for system evaluation, protocol optimization, and validation of advanced reconstruction methods. However, existing phantoms are often limited by simplified geometries, spatially uniform activity patterns, and complex preparation procedures. PurposeTo develop and evaluate PixelPrintPET, a 3D printing-based method for fabricating anatomically realistic PET/CT phantoms with spatially heterogeneous radiotracer distributions and a single-solution filling workflow that avoids physical compartmentalization. MethodsPixelPrintPET generates voxel-based printing instructions that encode spatially varying infill, which is realized during printing through modulation of filament extrusion, enabling heterogeneous activity distributions without compartmentalization of radioactivity at different activity concentrations. Calibration phantoms and anatomically structured phantoms were designed and printed using high-flow polylactic acid (PLA), with anatomical inputs derived from either digital atlas-based models or patient imaging data. The printed phantoms were subsequently filled by immersion in a radioactive solution, allowing activity distribution to be controlled by the internal porous structure. A bottom-up filling procedure with reduced surface tension was developed to ensure uniform infiltration and minimize air entrapment. Phantoms were imaged on the PennPET Explorer PET/CT system, and quantitative performance was evaluated using contrast recovery coefficient (CRC), target-to-background ratio (TBR), and comparisons with simulated or patient-derived reference data. ResultsA strong linear relationship between infill ratio and normalized signal (R2 = 0.998) was demonstrated by the calibration phantom, enabling reliable mapping between structure and activity. Additionally, air entrapment was minimized to less than 1% of the total phantom volume. In the contrast recovery phantom, CRC values were consistent with measurements using traditional phantoms. The brain phantom reproduced atlas-derived contrast patterns, with gray-to-white matter differences within 5% after accounting for resolution and other system effects. The patient-based thorax phantom showed high reproducibility across repeated scans, with differences within 3%, and closely matched the input patient image with regional differences within 10% in all regions except the lung. ConclusionsPixelPrintPET enables the fabrication of realistic, reproducible, and versatile PET/CT phantoms with a voxel-level control of the activity distribution. This approach provides a practical solution for generating patient-specific and application-specific phantoms, with the potential to accelerate system validation, protocol development, and clinical translation of advanced PET/CT technologies.
Killekar, A.; Shanbhag, A.; Miller, R. J.; Dey, D.; Bourque, J.; Phillips, L.; Chareonthaitawee, P.; Slomka, P.
Show abstract
BackgroundPrevious studies evaluated large language model (LLM) performance on the American Society of Nuclear Cardiology (ASNC) Board Preparation Exam. Without domain-specific context, the best model (GPT-4o) achieved 63.1%, below the estimated 65% passing threshold and the 78% mean score of human fellows-in-training (FITs). Providing textbook context improved GPT-4o to 73.8% on text-only questions, but still fell short of human trainees. Whether next-generation LLMs with retrieval-augmented generation (RAG) can exceed this gap is unknown. MethodsClaude Opus 4.7 and GPT-5.5 were administered all 168 questions (141 text-only, 27 image-based) from the 2023 ASNC Board Preparation Exam across 5 iterations each, using RAG with a nuclear cardiology textbook, companion atlas, and ASNC clinical guidelines. Claude used local FAISS-based semantic retrieval; GPT-5.5 used Azures cloud-hosted vector store. Performance was compared to prior LLM results and 13 human FITs. ResultsAcross 5 iterations, Claude Opus 4.7 achieved a mean accuracy of 86.3% {+/-} 1.4% (text 88.8%, image 73.3%). GPT-5.5 achieved 86.7% {+/-} 2.2% (text 88.5%, image 77.0%) but refused a mean of 12.2 questions (7.3%) per iteration due to safety filters. Both models surpassed the human FIT mean (78.0%) and the estimated passing threshold. Compared to GPT-4o without context (63.1%), this represents a 23-percentage-point improvement in 18 months. ConclusionNext-generation LLMs with RAG now surpass average human trainee performance on nuclear cardiology board preparation questions, suggesting significant potential as educational tools and knowledge-reference aids in cardiovascular imaging. Condensed AbstractAcross 5 iterations each, Claude Opus 4.7 and GPT-5.5 with retrieval-augmented generation achieved mean accuracies of 86.3% and 86.7% on the 2023 ASNC Board Preparation Exam (168 questions), both surpassing the mean human fellow-in-training score of 78%. GPT-5.5 refused a mean of 12.2 questions (7.3%) per iteration due to safety filters. These results represent a 23-percentage-point improvement over the best prior LLM without context (63.1%), demonstrating that RAG-enhanced LLMs have reached human-level proficiency in nuclear cardiology knowledge. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=111 SRC="FIGDIR/small/26352768v2_ufig1.gif" ALT="Figure 1"> View larger version (49K): org.highwire.dtl.DTLVardef@5f2465org.highwire.dtl.DTLVardef@4e80d3org.highwire.dtl.DTLVardef@1ebbb93org.highwire.dtl.DTLVardef@167d3c1_HPS_FORMAT_FIGEXP M_FIG C_FIG Overview of the three-study research arc evaluating LLM performance on the 2023 ASNC Board Preparation Exam. Study 1 (2024) tested four LLMs without context (best: GPT-4o, 63.1%). Study 2 (2025) added textbook context to GPT-4o (73.8%). Study 3 (2026, current) evaluated Claude Opus 4.7 and GPT-5.5 with retrieval-augmented generation across 5 iterations each (mean 86.3% and 86.7%, respectively), both surpassing the human fellow-in-training mean of 78%. Right panel shows the performance scale with key thresholds.
Liu, L. P.; Gurevich, A.; McClung, G.; Itkin, M.; Noël, P. B.
Show abstract
PurposeImaging of the central lymphatic system enables characterization of patient-specific lymphatic anatomy and accurate localization of leaks. Advancements in CT technology, particularly spectral CT, can enhance CT lymphangiography (CTL) with improved visualization and quantification. This study aimed to assess the feasibility of spectral CTL in both static and dynamic scans. Materials and Methods50% diluted iodinated contrast was injected into the bilateral superficial inguinal lymph nodes of a pig. The pig was scanned with a dual-layer spectral CT every 60 seconds for 10 minutes. To optimize contrast and visualize peristalsis, a second animal was injected with 25% and 10% diluted contrast and scanned dynamically 4 and 6.25 minutes after contrast injection. Conventional images and iodine maps were reconstructed to calculate the contrast-to-noise ratio (CNR). Additionally, the iodine density was measured adjacent to the lymphovenous junction to show fluctuations from peristalsis and contrast washout. ResultsIodine maps, compared to conventional images, separated the contrast-filled central lymphatic system from surrounding soft tissue and increased CNR to 895 compared to 43 with conventional images. 25% diluted contrast provided the best balance between visualization and quantification of the central lymphatic system, showing high and low iodine density regions corresponding to peristalsis. Iodine density peaked at 15.4 {+/-} 0.6 mg/mL and decreased to 2.0 {+/-} 0.1 mg/mL at 10.5 minutes. ConclusionSpectral CTL not only improves visualization of the central lymphatic system compared to CTL but also provides quantitative information for physiological characterization of lymphatic disease that can enhance current subjective assessment. Research highlights- Iodine maps from spectral CT lymphangiography separated contrast-filled lymphatic structures from surrounding soft tissue and provided better contrast-to-noise compared to conventional images. - Spectral CT lymphangiography enabled quantification of contrast in the central lymphatic system that demonstrated contrast washout and may be utilized for physiological characterization of disease. - Dynamic spectral CT imaging of the lymphatic system visually showed peristalsis in the thoracic duct and was further reflected in quantitative iodine density measurements.
Rashed, M.; Alabdulrahman, H.
Show abstract
Background Automated pelvic CT segmentation has advanced to reliable coarse bone extraction. Yet the structured anatomical hierarchy required for morphometry, fixation planning, bone quality mapping, and arthroplasty workflows remains unachieved. This study developed and validated a fully automated anatomy-informed pipeline that converts standard pelvic CT into a comprehensive, surgeon-readable subsegmentation of the pelvis and proximal femur. Methods Pelvic CT datasets were retrospectively collected from anonymized archives of hospitals affiliated with the Directorate of Health Affairs, Sharqia, Egypt. After eligibility screening, 757 normal adult cases were processed using a custom one-click 3D Slicer pipeline integrating TotalSegmentator for coarse extraction, followed by deterministic anatomy-based subsegmentation into 81 segments. One hundred randomly selected cases were validated against expert-corrected reference segmentations using Dice similarity coefficient, volume difference, surface distance metrics, and bilateral symmetry analysis. Results Of 1,316 screened cases, 757 met eligibility criteria. Across 8,100 case-segment observations, the pipeline achieved a mean Dice of 0.9926 +/- 0.0465. Complete agreement was observed for the sacrum, ilium, acetabulum, anterior and posterior columns, sciatic buttress, and all landmarks. Relative decreases were confined to boundary-dependent regions. Bilateral symmetry analysis confirmed a median surface agreement of 99.85% within 5 mm. Conclusion The pipeline demonstrated high accuracy and reproducibility across a large normal adult dataset, establishing a structured anatomical foundation for quantitative pelvic analysis and surgical planning workflows. Clinical feasibility across abnormal anatomy and decision-level applications awaits dedicated validation.
Wei, J.; Abdollahi, A.; Knoll, M.; Furkel, J.
Show abstract
Background and purposePrecise manual annotation of the left ventricular myocardial (LVM) wall is essential for cardiac substructure research, wall-specific radiation dosimetry, and segmentation model development. However, existing radiotherapy-oriented atlases and conventional CT viewing planes lack an explicit framework for reproducible, wall-level LVM delineation. To address this gap, we developed an anatomy-guided manual segmentation protocol for delineating the five LVM walls on non-contrast-enhanced CT (NECT) or contrast-enhanced CT (CECT) scans. Materials and methodsThis protocol was developed using 60 chest CT scans from two prospective cohorts at Heidelberg University Hospital, including 50 CECTs from IMRT-MC2 cohort and 10 NECTs from MAGELLAN cohort. Manual contouring was performed in 3D Slicer. Segmentation rules were established through review by a radiation oncologist and a cardiology expert, based on the American Heart Association 17-segment model, and were tested on additional CT scans before final protocol definition. ResultsThe protocol centers on three geometric steps: (1) defining the LV long axis using the endocardial apex and the center of the mitral annulus; (2) constructing an apical delimitation plane based on LV geometry; and (3) partitioning wall regions via intersections of the right ventricular and LV cavity centers in the short-axis view. This workflow enables structured segmentation of the anterior, septal, lateral, inferior, and apical LVM walls, supporting anatomically coherent 3D reconstruction. ConclusionThis study provides contouring steps and a representative atlas as a methodological basis for standardized annotation, with potential applications in dose-mapping cardiotoxicity analysis and deep-learning modeling for radiotherapy.
Kölmel, E. G.; Otero-Casal, P.; Pardo-Montero, J.
Show abstract
Clinical studies of prostate cancer treated with radically hypofractionated highdose-rate brachytherapy (HDR-BT) have reported a significant loss of tumor control that contradicts the standard linear-quadratic (LQ) and low /{beta} ratio paradigm for prostate cancer. In a previous study by our group, we showed that the linear-quadraticlinear (LQL) model could describe this response, but the underlying biological drivers remained unclear. In this follow-up study, we further investigate whether the interplay between hypoxia and reoxygenation kinetics can explain the poor response to extreme hypofractionation. We analyzed a large dataset of 3,239 patients (44 schedules) using a three-compartment reoxygenation model (the MSK model) that simulates the dynamics of oxic, intermediate, and hypoxic cell populations. Results show that the MSK model achieves an excellent fit to the clinical data (p > 0.99) while maintaining a biologically plausible low /{beta} ratio ([≤] 8 Gy). The reoxygenation model provided a performance comparable to the LQL model for low-risk prostate cancer, being slightly inferior to the LQL model to describe the response of intermediate-risk. This suggests that the observed reduction in tumor control is not necessarily a failure of the LQ formalism, but rather a consequence of oxygen dynamics associated with ultra-fractionated schedules, and provides a mechanistic basis for designing clinical trials exploring the response of prostate cancer to ultra-hypofractionation and the role of reoxygenation.
Haluptzok, T. D.; Sadeghi-Tarakameh, A.; Lagore, R. L.; Metzger, G. J.
Show abstract
PurposeTo address the limitations of single-distance, 1D performance metrics in RF coil design. This work introduces a multi-objective, volume-of-interest (VOI) based analysis to systematically characterize the trade-offs between power efficiency, pSAR efficiency, and homogeneity as a function of dipole length (l) and distance-to-load (d) for multiple dipole geometries and target anatomies. MethodsElectromagnetic simulations of straight and end-meandered dipole antennas were performed with varying lengths (100-500 mm) and distance-to-load (1-81 mm) over three anatomical targets (prostate, kidney, heart). Homogeneity, power efficiency, pSAR efficiency, and load sensitivity performance metrics were calculated within each anatomical VOI. Inter-element coupling at variable d was assessed in a 3-element array, and a subset of single-element simulations was experimentally validated using B1+ mapping. ResultsA fundamental trade-off was found between power efficiency and pSAR efficiency. Optimal power efficiency was achieved with shorter dipoles (150 mm < l < 300 mm) closer to the sample (d < 30 mm), while optimal pSAR efficiency and homogeneity were achieved with longer dipoles at further from the sample (d > 60 mm). Inter-element coupling increased with distance-to-load but could be managed by increasing element spacing. Experimental measurements were in good agreement with simulation trends. ConclusionIncreasing distance-to-load to 40-60 mm, compared with commonly used distances of 20-30 mm, offers a practical strategy for improving pSAR efficiency and homogeneity with a minimal decrease in power efficiency. This work provides a quantitative analysis that enables RF coil designers to make informed, data-driven decisions when developing next-generation body arrays and suggests that unshielded end-meandered dipoles could be an optimal transmit element geometry.
Yang, J.; Li, L.; Cao, J.; Zhang, J.
Show abstract
Objective:This study aims to compare the advantages and disadvantages of DLIR and adaptive statistical iterative reconstruction-V (ASIR-V) in thin-slice (2.5 mm) CT images of hepatic lesions characterized by high and low contrast. Additionally, the study seeks to determine the optimal DLIR strength for the evaluation of liver lesions. Methods:A retrospective analysis was performed on 90 patients who underwent abdominal contrast-enhanced CT scans. Group A comprised 48 patients with low-contrast lesions, while Group B included 42 patients with high-contrast lesions. The acquired images were reconstructed using post-processing DLIR at low (DLIR-L), medium (DLIR-M), and high (DLIR-H) strengths, all with a slice thickness of 2.5 mm (subgroups A1-A3, B1-B3). Furthermore, images were reconstructed with ASIR-V at 50% strength at slice thicknesses of 2.5 mm and 5 mm (subgroups A4/B4 and A5/B5, respectively). CT values and standard deviations (SD) of the liver and lesions were measured, and the corresponding signal-to-noise ratio (SNR) and contrast-to-noise ratio (CNR) were calculated. The edge rise slope (ERS) was determined using ImageJ software by measuring CT values along a line from the liver parenchyma to the lesion. Objective metrics were compared using one-way ANOVA, with independent samples t-tests applied for inter-group differences. Subjective scoring, which encompassed noise level, diagnostic confidence, and lesion margin delineation, was conducted by two radiologists, with differences analyzed using the Kappa test. Results: Objective evaluation revealed a progressive decrease in lesion SD and a progressive increase in SNR and CNR from subgroups A1/B1 to A3/B3. The SD of Group A2 decreased by 57.4% compared to A4, while the SNR and CNR of A2 icreased by 19.3% and 24.6% compared to A4. Although subgroup B2 had a lower SNR than B5, the difference was not statistically significant. SNR and CNR in B2 increased by 24.1% and 11.9%, respectively, compared to B4. ERS gradually decreased from A1/B1 to A3/B3. ERS values in A2 and B2 increased by 27.0% and 39.4%, respectively, relative to A5 and B5. Although A3 had a lower ERS than A1 and A2, all DLIR subgroups exhibited higher ERS than A5; similar trends were observed in Group B. Subjective evaluation indicated good inter-reader agreement (Kappa > 0.61, p < 0.05). As DLIR strength increased, noise scores rose progressively in both groups. However, noise in A2 and B2 was lower than in A4/A5 and B4/B5. Diagnostic confidence and lesion margin delineation scores were highest in A2 and B2, while all subjective scores were lowest in A5 and B5. Discussion: Most prior studies evaluated the liver, vessels, or confirmed that image quality can be guaranteed at low doses. However, there are few studies on specific individual lesions. Therefore, this study aims to investigate specific individual lesions. The details and detection rate were analyzed separately to confirm the clinical acceptability of 2.5-mm DLIR image in different contrast lesions. Conclusion: For both high- and low-contrast hepatic lesions, DLIR provides superior image quality compared to ASIR-V, with the 2.5mm DLIR-M setting being optimal. DLIR-M reduces image noise, improves spatial resolution, and produces images more suitable for diagnostic purposes.
Fatima, S.; Notnani, A.; Chaurasia, R. K.; Shirsath, K. B.; Khan, A.; Kumar, D.; Sapra, B. K.
Show abstract
PurposeLow-dose radiation-induced adaptive response (LDRIAR) is well documented, but its role in early DNA damage signalling remains unclear. This study aimed to investigate whether adaptive response influences initial DNA double-strand break (DSB) recognition, as reflected by {gamma}H2AX foci formation, and to evaluate its time-dependent expression in human lymphocytes. Materials and MethodsPeripheral blood lymphocytes from three healthy donors were exposed to a priming dose followed by a challenging dose at defined time intervals. DNA damage was assessed using {gamma}H2AX foci analysis, comparing acute and split-dose exposures in both PHA-stimulated (large) and non-stimulated (small) lymphocytes. ResultsA clear time-dependent adaptive response was observed. No significant reduction in {gamma}H2AX foci was detected at 1 h (p > 0.05). At 2 h, a significant decrease was observed ([~]7-8% in large and [~]13% in small lymphocytes; p < 0.01), which increased at 4 h ([~]12% and [~]22%, respectively; p < 0.001). The maximal response occurred at 15 h, with reductions of [~]40- 43% in large and [~]27% in small lymphocytes (p < 0.001). Small lymphocytes exhibited an earlier response, while large lymphocytes showed a greater magnitude at later time points. The temporal trend was consistent across donors, with minor variability at later intervals. ConclusionsThe findings demonstrate that LDRIAR is reflected at the level of DNA damage signalling and follows a defined temporal pattern with cell-type specificity. This suggests that adaptive response may influence early DSB-associated processes, contributing to a better understanding of radiation response mechanisms in radiobiology.
Yusufaly, T.; Transtrum, M.; Huang, L.; Sabok-Sayr, S.; Sgouros, G.; Hobbs, R.; Jia, X.
Show abstract
Developing parsimonious, mechanism-aware quantitative models that predict how biological effectiveness changes with different modifiers remains, in general, an unsolved problem. Advances in radiobiological research have created a large knowledge base of first-principles mechanistic models of radiation response that, in principle, could accurately predict radiosensitivity across different experimental and clinical conditions. However, in practice these mechanistic models come with an overabundance of parameters, the majority of which are practically unidentifiable and, moreover, likely unnecessary if one simply wishes to predict how radiosensitivity changes for some specific modifier of interest. Nevertheless, determining which few details in the full mechanistic model are relevant for a given purpose, as well as how to remove any other extraneous details, remains a highly non-trivial task. In this study, we demonstrate the potential of model reduction, starting from a detailed mechanistic description, as a systematic strategy for deriving parsimonious, experimentally falsifiable radiobiological descriptors. As a proof-of-concept demonstration, we apply the Manifold Boundary Approximation Method (MBAM) to a Mechanistic Model of DNA Repair and Survival (MEDRAS), for the problem of cell survival prediction following an acute exposure. Our findings reveal that the complete MEDRAS model for an arbitrary mixed-quality exposure can be structurally simplified to a reduced three-parameter model for an effective uniform-quality, named MEDRAS-LPL. Additional MBAM analysis on MEDRAS-LPL identifies two boundaries in parameter space, corresponding to sparsely ionizing and densely ionizing radiation. Mapping of MEDRAS-LPL parameter space on to effective LQ space further demonstrates that parameters close to the sparsely ionizing boundary line up with expectations from the theory of dual radiation, while parameters close to the densely ionizing boundary line up with expectations from a purely linear model based on a target-theory description. Moreover, our formalism predicts enhanced synergistic interactions between sparsely ionizing and densely ionizing radiation beyond the Zaider Rossi model (ZRM) paradigm, in line with empirical observations. The results highlight the potential for using reduced-order models not only for predictive applications but also for generating novel hypotheses that can inform future experimental designs and optimization strategies in radiobiology.
Nomura, Y.; Hanaoka, S.; Nakao, T.; Yamagishi, Y.; Kikuchi, T.; Sonoda, Y.; Miki, S.; Oba, K.; Yoshikawa, T.; Abe, O.
Show abstract
ObjectivesTo characterize longitudinal age-related changes in abdominal organ volumes using CT volumetry and to model nonlinear trajectories across multiple organs. Materials & MethodsThis retrospective single-center study included adults who underwent whole-body screening low-dose CT between 2006 and 2017. Subjects with at least eight examinations during a follow-up period of at least 78 months were included. After applying exclusion criteria, 700 participants with 6,739 CT series were analyzed. Non-contrast CT images were processed using automated organ segmentation, and volumes of the liver, pancreas, spleen, and kidneys were quantified. Longitudinal changes were modeled using generalized additive mixed models with sex-specific smooth functions of age and subject-level random effects. Age-dependent rates of change were estimated from model derivatives. ResultsA total of 700 participants (mean age, 56.9 {+/-} 9.8 years, 29.6% women) were evaluated. Liver, pancreas, and kidney volumes showed mild increases or plateaued at approximately 40-60 years of age, depending on the organ, and were followed by gradual declines with advancing age, whereas splenic volume showed a progressive decrease across the age range. These patterns showed nonlinear age dependence. The transition from positive to negative change rates tended to occur earlier in women than in men for several organs, particularly the liver and kidneys. ConclusionLongitudinal CT analysis demonstrated nonlinear age-related changes in abdominal organ volumes, with organ-specific trajectories and sex-related differences in the timing and magnitude of volume changes. QuestionHow do abdominal organ volumes change longitudinally with age, and can their trajectories be characterized for each organ? FindingsLongitudinal CT analysis demonstrated nonlinear, organ-specific volume trajectories, with transitions from stability to decline around 40-60 years and earlier transitions in women than men. Clinical RelevanceLongitudinal reference patterns of abdominal organ volumes on CT improve the interpretation of age-related changes and support more accurate differentiation between physiological variation and disease-related volume alterations.
Kohler, I. A.; Zheng, L.; Kuder, T. A.; Goedicke, O.; Ladd, M. E.; Hesser, J.
Show abstract
Diffusion MRI simulations based on realistic tissue microstructure provide a means to validate biophysical models and optimize acquisition protocols, but their computational cost restricts most studies to domains far smaller than a clinical voxel. The objective of this study was to develop an automated and scalable framework that converts whole-slide histology into diffusion MRI simulations at clinically relevant spatial scales while remaining feasible on standard workstation hardware. We present an end-to-end pipeline integrating two-dimensional whole-slide cell segmentation, mesh generation, and finite element Bloch-Torrey simulation. To enable simulations at large spatial scales without prohibitive memory growth, we introduce a subdomain tiling strategy in which the tissue domain is partitioned into extended subdomains simulated independently under no-flux boundary conditions. Signals are aggregated only from the central regions of each subdomain to minimize boundary artifacts. For an 800 {micro}m x 800 {micro}m histology-based domain, the aggregated signal differed by 0.07% from the corresponding full-domain finite element simulation while reducing wall-clock time from several days to hours and maintaining bounded memory usage independent of global domain size. When applied to a 2016 {micro}m x 2016 {micro}m heterogeneous region approximating the in-plane dimensions of a clinical voxel, the apparent diffusion coefficient obtained from the full domain differed from values computed in smaller dense and sparse subregions, demonstrating the influence of structural heterogeneity at clinically relevant scales on derived diffusion metrics. The proposed framework establishes an automated and memory-stable approach for generating diffusion MRI simulations directly from routine histology.
Ludwig, K. D.; Hatt, C. R.; Keith, L.; Matyga, A. W.; Te, H. S.; Landeras, L.; Chelala, L.; Patel, A. R.; Chung, J. H.
Show abstract
Objective: Coronary artery calcification (CAC) assessment for cardiovascular risk stratification is traditionally achieved using ECG-gated computed tomography (CT). Automated deep-learning (DL) algorithms may streamline opportunistic CAC detection and scoring, particularly on non-gated CT scans. This study evaluated the performance of a fully automated DL-based CAC scoring algorithm ("DL-CAC") against expert human scoring. Methods: The algorithm was trained on 1,260 chest CT scans from multiple databases to automatically identify coronary calcium, calculate Agatston scores, and assign a cardiovascular disease (CVD) risk classification. Performance was assessed on a holdout dataset (n=500) comprising ECG-gated calcium scoring CT scans and lung cancer screening non-gated chest CTs as well as in an external, independent CT dataset (n=129) from liver transplant candidates. Agreement with expert scoring was assessed using intraclass correlation coefficient (ICC) for Agatston scores and Cohen's {kappa} for CVD risk classification. Results: The algorithm demonstrated high agreement with expert scoring in the pooled calcium scoring and lung cancer screening cohorts, with an ICC of 0.947 for Agatston scores and {kappa} of 0.936 for CVD risk classification. For liver transplant candidates, the algorithm exhibited substantial agreement with expert scoring of non-gated CT scans ({kappa}=0.79) and a sensitivity of 90.4% and specificity of 96.4% in high-risk cases. Conclusion: These findings suggest that DL-based CAC scoring on non-gated CT scans may be a feasible alternative to traditional methods and could support opportunistic cardiovascular risk assessment in routine imaging. Further validation is warranted to assess clinical integration in broader practice settings.
Harrison, C. A.; Wu, M.; White, O.; Hopkinson, G.; Hughes, J.; Robertson, S.; Scurr, E.; Shur, J.; Castagnoli, F.; Charles-Edwards, G.; Koh, D.-M.; Winfield, J.
Show abstract
Objectives: AI-based reconstructions can reduce MRI acquisition times and/or improve image quality. Guidelines recommend clinical evaluations and post-deployment monitoring of these novel methods, however, there has been little investigation of the clinical resources required for such assessments. The aim of this study was to evaluate the healthcare resource utilisation and potential savings associated with AI-based reconstructions in rectal MRI. Methods: A retrospective economic costing analysis was conducted from the NHS healthcare perspective. Resource utilisation data were extracted from the Electronic Patient Records for 9 healthy volunteer scans and 104 rectal MRI examinations evaluating an AI-based reconstruction. The resource profile included the MRI scan and the staff time required for data acquisition and analysis. Results: The clinical evaluation of the AI-based reconstruction cost {pound}15,023. Deployment of the AI-based reconstruction reduced the length of an MRI rectum scan by 22 minutes, theoretically saving approximately {pound}3,437 per month. Addition of post-deployment quality control scans reduced this monthly saving to {pound}2,636. If the quality control scans were evaluated using radiologists rather than image quality metrics, monthly savings would be approximately {pound}2,541. With ongoing quality control, the clinical evaluation cost would be recouped between 5.8 and 6 months, compared with 4.4 months without ongoing quality control. Conclusions: Deploying AI-based reconstructions can yield cost savings through reduced scanning times. Quality control tests using image quality metrics would save radiological burden and reduce costs compared with conducting repeated image scoring by radiologists.
Doucette, M.; Zhang, Y.; Liao, C.-Y.; Lin, M.-H.; Yan, Y.; Dess, R. T.; Tendulkar, R. D.; Garant, A.; Hannan, R.; Jiang, S.; Nguyen, D.; Desai, N.; Yang, D. X.
Show abstract
Our study evaluated whether a deep learning auto segmentation model combined with machine learning triage can streamline radiotherapy clinical trial quality assurance (QA). We analyzed 107 stereotactic ablative radiotherapy (SABR) cases from a multi-institutional phase II clinical trial of neurovascular sparing prostate SABR, focusing on physician contours of the internal pudendal artery (IPA) as a novel organ-at-risk with substantial interobserver variability. Contours were scored by the trial principal investigator as Per-Protocol or Minor Deviation/Unacceptable. We applied a deep learning model for IPA auto-segmentation. Agreement between human and AI contours was then quantified using 14 overlap, distance, and surface metrics, and a supervised classifier was trained on these metrics to flag clinical trial protocol deviations. While AI segmentation achieved only modest geometric accuracy with mean Dice similarity coefficient of 0.446 and 95th percentile Hausdorff distance of 14.23, when incorporating all 14 metrics, a machine learning classifier yielded AUROC of 0.836, flagging all Minor Deviation/Unacceptable cases with 100% sensitivity on the 27 case hold-out set with 6 false positives and no false negatives. AI segmentation combined with metrics-based machine learning can triage protocol deviations within a multi-institution radiotherapy clinical trial, supporting prospective evaluation of AI-assisted trial QA.
Hameed, S.; Henry, K.; Jiang, F.; Bhusal, B.; Dillenbeck, H.; Gakenheimer-Smith, L.; Webster, G.; Golestani Rad, L.
Show abstract
Pediatric patients with cardiac implantable electronic devices (CIEDs) face limited MRI access due to RF-induced heating, and computational modeling is increasingly used to characterize this risk. The validity of these simulations, however, depends on pairing body models with clinically realistic lead configurations, guidance that is currently lacking. We retrospectively analyzed 302 CIED surgeries in 281 pediatric patients to derive weight-based constraints for simulation design. Weight alone discriminated epicardial from endocardial lead implantation with AUC = 0.90, and adding age and height yielded no improvement, supporting weight as a sufficient single-parameter selection metric. The probabilistic crossover between approaches occurred at 44~kg, substantially higher than the 10 to 15~kg threshold commonly cited in the literature, with a broad transition zone of 21 to 66~kg in which both lead types were routinely used. Lead length was likewise weight-constrained: only 25~cm leads were observed in patients below 6~kg, and leads of 45~cm or longer were uncommon below 50~kg. These findings yield a three-tier framework, with epicardial-only configurations below 21~kg, dual configurations within 21 to 66~kg, and weight-thresholded lead lengths throughout, enabling MRI safety simulations to focus on clinically realizable anatomy and device combinations.
Maier, C.; Solomon, E.; Verghese, G.; Chandarana, H.; Block, K.-T.; Alon, L.
Show abstract
Purpose: To develop and evaluate a flexible, software-defined radar platform for contactless, vendor-independent motion detection and correction in MRI. Methods: A continuous-wave (CW) Doppler radar was implemented using a software-defined radio and the open-source GNU Radio framework. The system was deployed inside a 1.5T MRI scanner and synchronized with MRI acquisitions. We evaluated the performance in a custom-developed internal motion phantom and in healthy volunteers to track respiration and bulk motion. The radar-derived signal was validated against cine MRI and used to demonstrate both retrospective and prospective motion management techniques in phantom and in healthy volunteers. Results: The radar provided robust motion signals that correlated strongly with image-based ground truth signals in both phantom and volunteer experiments. Signal characteristics were found to be frequency-dependent, enabling optimization for different motion regimes. Retrospective correction of free-breathing abdominal data using the radar signal effectively suppressed respiratory artifacts, achieving image quality comparable to a self-gating approach. Prospective triggering successfully reduced motion artifacts in the phantom study. The system also reliably detected sporadic events such as swallowing during neck imaging. Conclusion: Software-defined radar was demonstrated to be an effective platform for both prospective and retrospective motion correction. Its independence from the MRI system, ultra-wide band capabilities, and body-region versatility enable the adaptation of the technique for a wide range of imaging applications and protocols.
Wu, X.; Zhang, J.; He, Y.; Zhang, Y.; Kang, X.; Hu, W.; Li, Y.; Ma, H.; Wang, Y.; Song, Y.; Chen, X.; Huo, F.; Zhang, Y.; Yin, H.; Xi, Y.
Show abstract
Background: Traditional bone scintigraphy for detecting malignant bone metastases is limited by suboptimal accuracy and radiation exposure. Whole-body magnetic resonance imaging (WB-MRI), while an alternative, requires lengthy scan times and high patient compliance. Purpose: To develop a novel, rapid whole body bone screening (WB-RBS) MRI protocol and evaluate its diagnostic performance for bone metastasis detection. Materials and Methods: Patients with pathologically confirmed malignancies and healthy controls were prospectively enrolled. All participants underwent WB-RBS (acquisition time: about 10 min); patients additionally underwent WB-MRI (about 70 min). Three radiologists, blinded to clinical data, independently evaluated the images for bone metastases. A consensus expert diagnosis served as the reference standard to calculate the diagnostic performance of WB-RBS. Specificity was further assessed in the healthy control group. Results: Seventy patients and 19 healthy controls were included. WB-RBS demonstrated excellent inter-reader agreement at the patient level. Compared with the reference standard, WB-RBS achieved an accuracy of 77.1%-91.4% at the patient level and a slightly lower accuracy (70.6%-82.5%) at the lesion level. At diagnostic confidence thresholds 1-3, the correlations between WB-RBS ratings and the reference standard were statistically significant for both patient- and lesion-level analyses. Conclusion: WB-RBS showed favorable inter-reader agreement and high accuracy for bone metastasis screening at the patient level, while substantially reducing scan time and cost. Its rapid, radiation-free nature and high accessibility offer distinct clinical advantages, supporting its potential as an alternative screening tool to conventional bone scintigraphy.